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Abstract 

A (q, k, t)-design matrix is an m x n matrix whose pattern of zeros/non-zeros satisfies the 
following design-like condition: each row has at most q non-zeros, each column has at least 
k non-zeros and the supports of every two columns intersect in at most t rows. We prove 
that for m > n, the rank of any (q, k, i)-design matrix over a field of characteristic zero (or 
sufficiently large finite characteristic) is at least 

qtn 
-2k 

Using this result we derive the following applications: 

Impossibility results for 2-query LCCs over large fields. A 2-query locally correctable 
code (LCC) is an error correcting code in which every codeword coordinate can be re- 
covered, probabilistically, by reading at most two other code positions. Such codes 
have numerous applications and constructions (with exponential encoding length) are 
known over finite fields of small characteristic. We show that infinite families of such 
linear 2-query LCCs do not exist over fields of characteristic zero or large characteristic 
regardless of the encoding length. 

Generalization of known results in combinatorial geometry. We prove a quantita- 
tive analog of the Sylvester-Gallai theorem: Let Vi, . ■ . ,v m be a set of points in C d 
such that for every i £ [m] there exists at least 8m values of j £ [m] such that the 
line through Vi,Vj contains a third point in the set. We show that the dimension of 
{vi, . . . , v m } is at most 0(l/<5 2 ). Our results generalize to the high dimensional case 
(replacing lines with planes, etc.) and to the case where the points are colored (as in 
the Motzkin- Rabin Theorem). 
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1 Introduction 



In this work we study what combinatorial properties of matrices guarantee high algebraic rank, 
where a property is combinatorial if it depends only on the zero/non-zero pattern of the matrix, 
and not on the values of its entries. This question has a rich history in mathematics (see 
Section [L2|) . and some computer science motivations: 

Locally correctable codes. A locally correctable code is an error correcting code in which 
for every codeword y, given a corrupted version y of y and an index i, one can recover 
the correct value of yt from y by looking only at very few coordinates of y. It is an open 
question in coding theory to understand the tradeoffs between the fraction of errors, locality 
(number of coordinates read) and rate (ratio of message length to codeword length) of such 
codes, with very large gaps between the known upper bounds and lower bounds (see the 
survey |Tre04p . The question is open even for linear codes, where the condition of being 
locally correctable turns out to be equivalent to the existence of low weight codewords in 
the dual codewords that are "well-spread" in some precise technical sense (see Section [7]) . 
Because of the relation between the rate of the code and its dual, the question becomes 
equivalent to asking whether this combinatorial "well-spreadness" condition guarantees 
high rank. 

Matrix rigidity. A longstanding question is to come up with an explicit matrix that is rigid 
in the sense that its rank cannot be reduced by changing a small number of its entries. 
Random matrices are extremely rigid, and sufficiently good explicit constructions will yield 
lower bounds for arithmetic circuits |Val77j . though we are still very far from achieving 
this (see the survey [Lok09]). One can hope that a combinatorial property guaranteeing 
large rank will be robust under small perturbations, and hence a matrix satisfying such a 
property will automatically be rigid. 

In both these cases it is crucial to obtain bounds on the rank that depend solely on the 
zero/non-zero pattern of the matrix, without placing any restrictions on the non-zero coefficients. 
For example, there are very strong bounds known for matrix rigidity under the restriction that 
the non-zero coefficients have bounded magnitude (see Chapter 3 in [Lok09]), but they only 
imply lower bounds in a very restricted model. In fact, there is a relation between the two 
questions, and sufficiently good answers for the first question will imply answers for the second 
one [DvilOj . We stress that these two examples are in no way exhaustive. The interplay between 
combinatorial and algebraic properties of matrices is a fascinating question with many potential 
applications that is still very poorly understood. 

1.1 Our Results 

In this work we give a combinatorial property of complex matrices that implies high rank. While 
not strong enough to prove rigidity results, we are able to use it to obtain several applications 
in combinatorial geometry and locally correctable codes. Our main result is the following the- 
orem, giving a lower bound on the rank of matrix whose non-zero pattern forms has certain 
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combinatorial-design like properties in the sense that the sets of non-zero entries in each column 
have small intersections. (This theorem is restated as Theorem 13,21 ) 

Theorem 1 (Rank bound for design matrices). Let m > n. We say that an m x n complex 
matrix A is a (q,k,t)- design matrix if every row of A has at most q non-zero entries, every 
column of A has at least k non-zeroes entries, and the supports of every two columns intersect 
in at most t rows. For every such A, 



We also show that Theorem [TJ and in fact any result connecting the zero/non-zero pattern 
to rank, can be made to hold over arbitrary characteristic zero fields and also over fields of 
sufficiently large (depending on m, n) finite characteristic. 

1.1.1 Applications to Combinatorial Geometry 

Our most immediate applications of Theorem [T] are to questions regarding line-point incidences. 
Results on line-point incidences have recently found use in the area of computational complexity 
in relation to pseudo-randomness [BKT04, BIW06J and de-randomization [KS09, SS1Q]. In this 
setting we have an arrangement of a finite number of points in real or complex space. Every such 
arrangement gives rise to a set of lines, namely, those lines that pass through at least two of the 
points in the arrangement. Information about these lines can be converted, in some cases, into 
information about the dimension of the set of points (i.e. the dimension of the space the points 
span). Our rank theorem can be used to derive generalizations for two well-known theorems in 
this area: the Sylvester- Gallai theorem and the Motzkin-Rabin theorem. 

Generalizing the Sylvester-Gallai Theorem. The Sylvester-Gallai (SG for short) theorem 
says that if m distinct points vi,...,v m G M. d are not collinear, then there exists a line that passes 
through exactly two of them. In its contrapositive form the SG theorem says that if for every 
i j the line through Vi and Vj passes through a third point v^, then dim{?;i, . . . , v m } < 1, 
where dim{t>i, . . . ,v m } is the dimension of the smallest affine subspace containing the points. 
This theorem was first conjectured by Sylvester in 1893 |Syl93|, proved (in dual form) by Melchior 
in 1940 [Mel40j, and then independently conjectured by Erdos in 1943 [Erd43] and proved by 
Gallai in 1944. The SG theorem has several beautiful proofs and many generalizations, see the 
survey [BM90]. Over the complex numbers the (tight) bound on the dimension is 2 instead of 
1. The complex version was first proven by Kelly [Kel86] using a deep results from algebraic 
geometry, and more recently, an elementary proof was found by Elkies, Pretorius and Swanepoel 
[ES06] who also proved it over the quaternions with an upper of 4 on the dimension. 

We say that the points v±, . . . , v m (in R rf or C d ) form a 5-SG configuration if for every i G [m] 
there exists at least 5m values of j G [m] such that the line through Vi,Vj contains a third point 
in the set. Szemeredi and Trotter [ST83 showed that, when 5 is larger than some absolute 
constant close to 1, then the dimension of a 5-SG configuration is at most one (over the reals). 
We show the following generalization of their result to arbitrary 5 > (and over the complex 
numbers). 




3 



Theorem 2 (Quantitative SG theorem). If v\, . . . , v m G C d is a 5-SG configuration then 
dirndl, . . . ,v m } < 13 /5 2 . 

We note that one cannot replace the bound 13/<5 2 of Theorem [2] with 1 or even with any 
fixed constant, as one can easily create a <5-SG configuration of dimension roughly 2/5 by placing 
the points on 1/5 lines. This is analogous to error correcting codes, where once the fraction 5 
of agreement between the original and corrupted codeword drops below half there can be no 
unique decoding. In that sense our result can be thought of as a list decoding variant of the SG 
theorem, whereas the result of [ST83] is its unique decoding variant. We also show an "average 
case" version of the SG theorem, proving a bound on the dimension of a large subset of the 
points under the assumption that there are many collinear triples (see Theorem 14.8ft . 

We also prove a version of Theorem 14.31 with lines replaced by A;- flats (fc-dimensional affine 
subspaces). This generalizes a theorem of Hansen [Han65, BE67] which deals with the case 
a = 1. The statement of this result is technical and so we give it in Section [5] where it is also 
proven. 

Since our proofs use elementary (and purely algebraic) reductions to the rank theorem, they 
hold over arbitrary fields of characteristic zero or of sufficiently large finite characteristic. This is 
in contrast to many of the known proofs of such theorems which often rely on specific properties 
of the real (or complex) numbers. However, we currently do not recover the full version of the 
original SG theorem, in the sense that even for 5 = 1 we do not get a bound of 1 (or 2 for 
complex numbers) on the dimension. (However, the term 13/ 5 2 can be improved a bit in the 
5 = 1 case to obtain a bound of 9 on the dimension.) 

Generalizing the Motzkin-Rabin Theorem. The Motzkin-Rabin (MR for short) theorem 
(see e.g. [BM90J) is an interesting variant of the Sylvester-Gallai theorem that states that if 
points vi, . . . , v m £ M. d are colored either red or blue and there is no monochromatic line passing 
through at least two points, then they are all collinear. As in the SG theorem, we obtain a 
quantitative generalization of the MR theorem such that (letting b and r be the numbers of blue 
and red points respectively), if for every blue (resp. red) point v, there are 5b blue (resp. 5r 
red) points v' where the line through v and v' passes through a red (resp. blue) point, then 
dim{?;i, . . . ,v m } < 0(1/ 5 4 ). We also prove a three colors variant of the MR theorem, showing 
that if vi,...,v m are colored red, blue and green, and all lines are not monochromatic, then 
dim{t>i, . . . ,v m } is at most some absolute constant. 

1.1.2 Locally Correctable Codes 

A (linear) q query locally correctable code ((q, 5)-LCC for short) over a field F is a subspace 
CCF" such that, given an element y that disagrees with some y S C in at most 5n positions and 
an index i 6 [n], one can recover yi with, say, probability 0.9, by reading at most q coordinates 
of y. Over the field of two elements F2 the standard Hadamard code construction yields a (2,5)- 
query LCC with dimension 0(log(n)) for constant 5 > (see the survey |Tre04j ). In contrast 
we show that for every constant 5 > there do not exist infinite family of such codes over the 
complex numbers: 
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Theorem 3 (Impossibility of 2-query LCCs over C). If C is a 2-query LCC for 8 fraction of 
errors over C, then dim(C) < 0(l/5 9 ). 

We note that the Hadamard construction does yield a locally decodable code over the com- 
plex numbers with dimension O(logn). Locally decodable codes are the relaxation of a locally 
correctable codes where one only needs to be able to recover the coordinates of the original 
message as opposed to the codeword. Thus over the complex numbers, there is a very strong 
separation between the notions of locally decodable and locally correctable codes, whereas it is 
consistent with our knowledge that for, say, F2 the rate/locality tradeoffs of both notions are 
the same. 

1.2 Related Work 

The idea to use matrix scaling to study structural properties of matrices was already present in 
[CPRQO]. This work, which was also motivated by the problem of matrix rigidity, studies the 
presence of short cycles in the graphs of non-zero entries of a square matrix. 

A related line of work on the rank of 'design' matrices is the work emerging from Hamada's 
conjecture |Ham73] . (See [JT09| for a recent result and more references.) Here, a design matrix 
is defined using stricter conditions (each row/column has exactly the same number of non-zeros 
and the intersections are also all of the same size) which are more common in the literature 
dealing with combinatorial designs. In order to be completely consistent with this line of work 
we should have called our matrices 'approximate-design' matrices. We chose to use the (already 
overused) word 'design' to make the presentation more readable. We also note that considering 
approximate designs only makes our results stronger. Hamada's conjecture states that of all 
zero/one matrices whose support comes from a design (in the stricter sense), the minimal rank 
is obtained by matrices coming from geometric designs (in our language, Reed-Muller codes). 
In contrast to this paper, the emphasis in this line of works is typically on small finite fields. We 
note here that the connection between Hamada's conjecture and LCCs was already observed by 
Barkol, Ishai and Weinreb [BIW07J who also conjectured (over small fields) the 'approximate- 
design' versions which we prove here for large fields. 

Another place where the support of a matrix is connected to its rank is in graph theory 
where we are interested in minimizing the rank of a (square, symmetric) real matrix which has 
the same support as the adjacency matrix of a given graph. This line of work goes back for over 
fifty years and has many applications in graph theory. See [FH07] for a recent survey on this 
topic. 

Over the reals we can also ask about the minimal rank of matrices with certain sign-pattern. 
That is, given a matrix over {1,-1}, what is the minimal rank of a matrix which has the 
same sign-pattern. This minimal rank is called the sign-rank of a matrix. The question of 
coming up with (combinatorial or otherwise) properties that imply high sign-rank is one of 
major importance and has strong connections to communication complexity, learning theory 
and circuit complexity, among others. For a recent work with plenty of references see |RS08j . In 
particular we would like to mention a connection to the work of Forster l bi ()2 on the sign-rank 
of the Hadamard matrix. (An earlier version of this work used a variant |Bar98} IHarlOj of a 
lemma from [For02] instead of the results of [RS89J on matrix scaling to obtain our main result.) 
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1.3 Organization 



In Section [2] we give a high level overview of our techniques. In Section [3] we prove our main 
result on the rank of design matrices. In Section [J] we prove our quantitative variants of the 
Sylvester- Gallai theorem. In Section [5] we prove the high-dimensional analog of Theorem 14.31 
where lines are replaced with flats. In Section [6] we prove our generalizations of the Motzkin- 
Rabin theorem. In Section [7] we prove our results on locally correctable codes. In Section [8] we 
show how our results extend to other fields. We conclude in Section [9] with a discussion of open 
problems. 

2 Our Techniques 

We now give high-level proof overviews for some of our results. 
2.1 Rank Lower Bounds for Design Matrices 

Theorem Q]- the rank lower bound for design matrices - is proved in two steps. We now sketch 
the proof, ignoring some subtleties and optimizations. The proof starts with the observation 
that, as in the case of matrix rigidity and similar questions, the result is much easier to prove 
given a bound on the magnitude of the non-zero entries. Indeed, if A is a (q, k, i)-design matrix 
and all of its non-zero entries have absolute value in [1 /c, 1] for some constant c, then the n x n 
matrix M = A* A is diagonally dominant, in the sense that for all i ^ j, ma > k/c 2 but 
\ m ij\ < t. (Here A* denotes the conjugate transpose of A.) Thus one can use known results on 
such matrices (e.g. [Alo09j ) to argue that rank(yl) > rank(M) > n — (ntc 2 /k) 2 . Our main idea 
is to reduce to this case where the non-zero coefficients of A are (roughly) bounded using matrix 
scaling. 

A scaling A of a matrix A is obtained by multiplying for all i,j, the z'th row of A by some 
positive number pi and the j'th column of A by some positive number 7^. Clearly, A and A share 
the same rank and zero/non-zero pattern. We use known matrix-scaling results [Sin64, RS89 
to show that every (q, k, i)-design matrix A has a scaling in which every entry has magnitude at 
most (roughly) 1 but its columns have norm at least (roughly) ^Jkjq. We note that the typical 
application of matrix-scaling was with respect to the £i-norm of the rows and columns. Here we 
take a different path: We use scaling with respect to ^-norm. 

We defer the description of this step to Section [3] but the high level idea is to use a theorem 
of |RS89j that shows that such a scaling exists (in fact without the dependence on q) if A had 
the property of not containing any large all-zero sub-matrix. While this property cannot be in 
general guaranteed, we show that by repeating some rows of A one can obtain a matrix B that 
has this property, and a scaling of B can be converted into a scaling of A. Since our lower bound 
on the entry ma in the bounded coefficient case (where again M = ^4*^4) only used the fact that 
the columns have large norms, we can use the same argument as above to lower bound the rank 
of M, and hence of A. 
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2.2 Generalized Sylvester-Gallai Theorem 

Recall that the quantitative SG theorem (Theorem [2]) states that every 5-SG configuration 
vi, . . . , v n , has dimension at most 13/<5 2 . Our proof of Theorem [2] uses Theorem Q] as follows. 
Suppose for starters that every one of these lines passed through exactly three points. Each 
such line induces an equation of the form avi + (3vj + jVk = 0. Now for m = 5n 2 , let A be the 
m x n matrix whose rows correspond to these equations. Since every two points participate in 
only one line, A will be a (3, 5n, 1) design matrix, meaning that according to Theorem dj ^4's 
rank is at least n — (£A . Since A times the matrix whose rows are v\, . . . , v n is zero we have 
dim{ui, . . . ,v n } < n — rank(^4). We thus get an upper bound of [9/A\ = 2 on this dimension. 
To handle the case when some lines contain more than three points, we choose in some careful 
way from each line £ containing r points a subset of the (g) equations of the form above that it 
induces on its points. We show that at some small loss in the parameters we can still ensure the 
set of equations forms a design, hence again deriving a lower bound on its rank via Theorem [TJ 

Our method extend also to an "average case" SG theorem (Theorem 14. 8p . where one only 
requires that the set of points supports many (i.e., fi(n 2 )) collinear triples and that each pair 
of points appear together in a few collinear triples. In this case we are able to show that there 
is a subset of Q(n) points whose span has dimension O(l). See Section [4] for more details. Our 
generalizations of the Motzkin-Rabin theorem follow from our theorem on 5-SG configurations 
via simple reductions (see Section [6]) . 

2.3 Locally Correctable Codes 

At first sight, Theorem [3] - non existence of 2 query locally correctable codes over C - seems 
like it should be an immediate corollary of Theorem [21 Suppose that a code C maps C d to 
C n , and let vi,...,v n denote the rows of its generating matrix. That is, the code maps a 
message x G to the vector ((vi,x),... ,(v n ,x)). The fact that C is a 2 query LCC for 5 
errors implies that for every such row Vi, there are roughly 5n pairs j,k such that Vi is in the 
span of {vj,Vf.}. Using some simple scaling/change of basis, this gives precisely the condition 
of being a 5-SG configuration, save for one caveat: In a code there is no guarantee that all 
the vectors vi, . . . ,v n are distinct. That is, the code may have repeated coordinates that are 
always identical. Intuitively it seems that such repetitions should not help at all in constructing 
LCCs but proving this turned out to be elusive. In fact, our proof of Theorem [3] is rather more 
complicated than the proof Theorem [21 involving repeated applications of Theorem [T] which 
result also in somewhat poorer quantitative bounds. The idea behind the proof to use a variant 
of the "average case" SG theorem to repeatedly find Q(n) points among v\,. . . ,v n whose span 
has O(l) dimension, until there are no more points left. We defer all details to Section [71 

Given Theorem [H one may have expected that Theorem [3] could be extended for LCCs of 
any constant number q of queries. After all, the condition of C being an LCC intuitively seems 
like only a slight relaxation of requiring that the dual code of C has a generating matrix whose 
non-zero pattern is a combinatorial design, and indeed in known constructions of LCCs, the 
dual code does form a design. We are not, however, able to extend our results to 3 and more 
queries. A partial explanation to our inability is that 3 query LCCs give rise to configuration 
of planes (instead of lines) and point and planes exhibit much more complicated combinatorial 
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properties than lines. 



3 Rank of Design Matrices 

In this section we prove our main result which gives a lower bound on the rank of matrices 
whose zero/non-zero pattern satisfies certain properties. We start by defining these properties 
formally. 

Definition 3.1 (Design matrix). Let A be an m x n matrix over some field. For % G [m] let 
Ri C [n] denote the set of indices of all non-zero entries in the i'th row of A. Similarly, let 
Cj C [m], j G [n], denote the set of non-zero indices in the j'th column. We say that A is a 
(g, k, t) -design matrix if 

1. For all i G [m], \Ri\ < q. 

2. For all j G [n], \Cj\> k. 

3. For all ji ^ j 2 G [n], \C h n C h \ < t. 

Theorem 3.2 (Restatement of Theorem [1] - rank of design matrices). Let A be an m x n 

complex matrix. If A is a (q,k,t)- design matrix then 



Remark 3.3. The proof of the theorem actually holds under a slightly weaker condition on 
the sizes of the intersections. Instead of requiring that \Cj 1 n Cj 2 \ < t for all pairs of columns 
31 7^ 32 , it is enough to ask that 



That is, there could be some pairs with large intersection as long as the average of the squares 
is not too large. 

The proof of the theorem is given below, following some preliminaries. 
3.1 Preliminaries for the Proof of Theorem 13.21 

Notation: For a set of real vectors V G C n we denote by rank(V) the dimension of the vector 
space spanned by elements of V. We denote the ^ 2 - n orm of a vector v by \\v\\. We denote by I n 
the n x n identity matrix. 

We start with definitions and results on matrix scaling. 

Definition 3.4. [Matrix scaling] Let A be an m x n complex matrix. Let p G C m ,7 G C" be 
two complex vectors with all entries non-zero. We denote by 




\C h C\C n \ 2 <n 2 -t 2 . 



SC(AP,7) 
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the matrix obtained from A by multiplying the (i, j)'th element of A by pi ■ 7^. We say that two 
matrices A, B of the same dimensions are a scaling of each other if there exist non-zero vectors 
p, 7 such that B = SC(A, p, 7). It is easy to check that this is an equivalence relation. We refer 
to the elements of the vector p as the row scaling coefficients and to the elements of 7 as the 
column scaling coefficients. Notice that two matrices which are a scaling of each other have the 
same rank and the same pattern of zero and non-zero entries. 

Matrix scaling originated in a paper of Sinkhorn [Sin64] and has been widely studied since 
(sec [LSWOOj for more background). The following is a special case of a theorem from [RS89 
that gives sufficient conditions for finding a scaling of a matrix which has certain row and column 
sums. 

Definition 3.5 (Property- S). Let ibeanmxn matrix over some field. We say that A satisfies 
Property-S if for every zero sub-matrix of A of size a x b it holds that 

^<i. (i) 

m n 

Theorem 3.6 (Matrix scaling theorem, Theorem 3 in [RS89J ). Let A be an m x n real matrix 
with non-negative entries which satisfies Property-S . Then, for every e > 0, there exists a scaling 
A' of A such that the sum of each row of A' is at most 1 + e and the sum of each column of 
A' is at least m/n — e. Moreover, the scaling coefficients used to obtain A' are all positive real 
numbers. 

The proof of the theorem is algorithmic [Sin64]: Start by normalizing A's rows to have sum 
1, then normalize A's columns to have sum m/n, then go back to normalizing the rows the have 
sum 1, and so forth. It can be shown (using a suitable potential function) that this process 
eventually transforms A to the claimed form (since A has Property-^). 

We will use the following easy corollary of the above theorem. 

Corollary 3.7 (-^-scaling). Let A = (a^-) be anmxn complex matrix which satisfies Property-S . 
Then, for every e > 0, there exists a scaling A' of A such that for every i e [m] 

\aij\ 2 > m/n — e. 

i£[m] 

Proof. Let B = (fey) = (la^l 2 ). Then B is a real non-negative matrix satisfying Property-S. 
Applying Theorem 13.61 we get that for all e > there exists a scaling B' = SC(B, p, 7), with p, 7 
positive real vectors, which has row sums at most 1+e and column sums at least m/n — e. Letting 
Pi = y/Pi an d l'i = x/li we get a scaling SC(A, p' , 7') of A with the required properties. □ 

We will use a variant of a well known lemma (see for example [Alo09j ) which provides a 
bound on the rank of matrices whose diagonal entries are much larger than the off-diagonal 
ones. 



and for every j E [n] 
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Lemma 3.8. Let A = (ay) be annxn complex hermitian matrix and let < I < L be integers. 
Suppose that an > L for all i £ [n] and that |ay| < £ for all i ^ j. Then 

rank(A)> i + n n m)2 >n-(nl/Lf. 

Proof. We can assume w.l.o.g. that an = L for all i. If not, then we can make the inequality 
into an equality by multiplying the i'th row and column by (L/an) 1 ^ 2 < 1 without changing 
the rank or breaking the symmetry. Let r = rank(A) and let \\, . . . ,X r denote the non-zero 
eigenvalues of A (counting multiplicities). Since A is hermitian we have that the Aj's are real. 
We have 

n 2 -L 2 = tv(Af=(±x) <r-±X 2 

\i=l J i=l 

< r ■ (n- L 2 +n 2 ■ I 2 ). 

Rearranging we get the required bound. The second inequality in the statement of the lemma 
follows from the fact that 1/(1 + x) > 1 — x for all x. □ 



n 



3.2 Proof of Theorem EE2I 

To prove the theorem we will first find a scaling of A so that the norms (squared) of the columns 
are large and such that each entry is small. 

Our first step is to find annkxn matrix B that will satisfy Property-,? and will be composed 
from rows of A s.t. each row is repeated with multiplicity between and q. To achieve this we 
will describe an algorithm that builds the matrix B iteratively by concatenating to it rows from 
A. The algorithm will mark entries of A as it continues to add rows. Keeping track of these 
marks will help us decide which rows to add next. Initially all the entries of A are unmarked. 
The algorithm proceeds in k steps. At step i (i goes from 1 to k) the algorithm picks n rows 
from A and adds them to B. These n rows are chosen as follows: For every j G {1, . . . , n} pick 
a row that has an unmarked non-zero entry in the j'th column and mark this non-zero entry. 
The reason why such a row exists at all steps is that each column contains at least k non-zero 
entries, and in each step we mark at most one non-zero entry in each column. 

Claim 3.9. The matrix B obtained by the algorithm has Property-S and each row of A is added 
to B at most q times. 

Proof. The n rows added at each of the k steps form an n x n matrix with non-zero diagonal. 
Thus they satisfy Property-S". It is an easy exercise to verify that a concatenation of matrices 
with Property-S* also has this property. The bound on the number of times each row is added 
to B follows from the fact that each row has at most q non-zero entries and each time we add 
a row to B we mark one of its non-zero entries. □ 

Our next step is to obtain a scaling of B and, from it, a scaling of A. Fix some e > (which 
will later tend to zero) . Applying Corollary 13.71 we get a scaling B' of B such that the ^2-norm 
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of each row is at most \/l + e and the ^-norm of each column is at least nk/n — e = \/k — e. 
We now obtain a scaling A' of A as follows: The scaling of the columns are the same as for B'. 
For the rows of A appearing in B we take the maximal scaling coefficient used for these rows in 
B', that is, if row i in A appears as rows ii, iit ■ ■ ■ i V i n B, then the scaling coefficient of row i 
in A' is the maximal scaling coefficient of rows ii,i2, ■ ■ ■ , V in B'. For rows not in B, we pick 
scaling coefficients so that their £2 norm (in the final scaling) is equal to 1. 



Claim 3.10. The matrix A' is a scaling of A such that each row has l2-norm at most \/l + e 
and each column has l2-norm at least \/ (k — e)/q. 

Proof. The fact that the row norms are at most yjl + e is trivial. To argue about the column 
norms observe that a column of B' is obtained from repeating each non-zero element in the 
corresponding column of A' at most q times (together with some zeros). Therefore, if we denote 
by ci, . . . , c s the non-zero entries in some column of A', we have that 

s 

■ M 2 >k-e, 

i=i 

where the mj's are integers between and q. In this last inequality we also relied on the fact 
that we chose the maximal row scaling coefficient among all those that correspond to the same 
row in A. Therefore, 



s 

El- 12 

i=i 



\ci\ >{k- e)/q, 

as required. □ 



Our final step is to argue about the rank of A' (which is the same as the rank of ^4). To this 
end, consider the matrix 

M = (A')* ■ A', 

where (^4/)* is A 1 transposed conjugate. Then M = (rriij) is an n x n hermitian matrix. The 
diagonal entries of M are exactly the squares of the ^-norm of the columns of A' . Therefore, 

ma > (k - e)/q 

for all i € [n\. 

We now upper bound the off-diagonal entries. The off-diagonal entries of M are the inner 
products of different columns of A' . The intersection of the support of each pair of different 
columns is at most t. The norm of each row is at most \/l + e. For every two real numbers a, f3 
so that a 2 + f3 2 < 1 + e we have \a ■ j3\ < 1/2 + e', where e' tends to zero as e tends to zero. 
Therefore 



rn 



<t-(l/2 + e') 



for all i ^ j E [n] . Applying Lemma 13.81 we get that 

V*(l/2 + e') -n x 2 



rank(^4) = rank(yl') > n 



A: 



Since this holds for all e > it holds also for e = 0, which gives the required bound on the rank 
of A □ 
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4 Sylvester- Gallai Configurations 



In this section we prove the quantitative Sylvester-Gallai (SG) Theorem. We will be interested 
with point configurations in real and complex space. These are finite sets of distinct points 
v\, . . . ,v n in M. d or C d . The dimension of a configuration is defined to be the dimension of the 
smallest affine subspace containing all points. 

Definition 4.1 (Special and ordinary lines). Let v\, . . . , v n G C d be a set of n distinct points 
in d-dimensional complex space. A line t passing through at least three of these points is called 
a special line. A line passing through exactly two points is called an ordinary line. 

Definition 4.2 (<5-SG configuration). Let 5 G [0, 1]. A set of n distinct points v\, . . . ,v n G C d 
is called a 5-SG configuration if for every i 6 [n], there exists a family of special lines Li all 
passing through V{ and at least 5n of the points vx,...,v n are on the lines in L{. (Note that 
each collection Lj may cover a different subset of the n points.) 

The main result of this section bounds the dimension of 5-SG configurations for all 5 > 0. 
Since we can always satisfy the definition by spreading the points evenly over 1/5 lines we know 
that the dimension can be at least 2/5 (and in fact in complex space at least 3/5). We prove an 
upper bound of 0(l/5 2 ). 

Theorem 4.3 (Restatement of Theorem [2] - quantitative SG theorem). Let 5 G (0,1]. Let 
v±, . . . , v n G C d be a 5-SG configuration. Then 



Moreover, the dimension of a 1-SG configuration is at most 10. 

The constants in the proof have been optimized to the best of our abilities. Notice that in 
the above theorem 5 can be dependant on n. For example, a (l/log(n))-SG configuration of n 



4.1 Preliminaries to the Proof of Theorem 14.31 

The notion of a latin square will turn out useful in the proof: 

Definition 4.4 (Latin squares). An r x r latin square is an r x r matrix D such that Dij G [r] 
for all i,j and every number in [r] appears exactly once in each row and in each column. A latin 
square D is called diagonal if Di^ = i for all i G [r] . 

Theorem 4.5 ( [Hil73j ) . For every r > 3 there exists a diagonal r x r latin square. 

We note that we use diagonal latin squares only to optimize constant factors. If one does 
not care about such factors then there is a simple construction that serves the same goal. 

The following lemma is an easy consequence of the above theorem. 

Lemma 4.6. Let r > 3. Then there exists a set T C [r] 3 of r 2 — r triples that satisfies the 
following properties: 



dim{v\, . . . , v n } < 13/<5 2 . 
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1. Each triple {t\,t2,t^) £ T is of three distinct elements. 

2. For each i £ [r] there are exactly 3(r — 1) triples in T containing i as an element. 

3. For every pair i,j £ [r] of distinct elements there are at most 6 triples in T which contain 
both i and j as elements. 

Proof. Let D be an r x r diagonal latin square which we know exists from Theorem 14.51 Define 
T C [r] 3 to be the set of all triples k) £ [r] 3 with i ^ j such that Dij = k. The number of 
such triples is r 2 — r. Property 1 holds by the definition of diagonal latin square — we cannot 
have Dij = i for j/i since Dii = i and every row in D has distinct as the (i, i) entry in D is 
labeled i for all i £ [r], and similarly we cannot have Dij = j for i ^ j. 

Let i £ [r]. By construction, there are r — 1 triples in T which have i as their first entry, and 
r — 1 triples that have i as their second entry. There are also r — 1 triples in T which have i as 
their last entry, since for every one of the r — 1 rows i' ^ i there is exactly one location j' ^ %' in 
which the label i appears, and that contributes the triple (i',j',i) to T. This proves Property 2. 

To prove Property 3 observe that two triples in T can agree in at most one place. For example, 
knowing the row and column determines the label, knowing the row and label determines the 
column, and so forth. Therefore, a pair cannot appear in more than 6 triples since otherwise 
there would have been at least two triples with i,j at the same places, and these triples would 
violate the above rule. □ 

4.2 Proof of Theorem M 

Let V be the n x d matrix whose i'th row is the vector «j. Assume w.l.o.g. that v\ = 0. Thus 

dim{ui, . . . , v n } = rank(V). 

The overview of the proof is as follows. We will first build an m x n matrix A that will 
satisfy A ■ V = 0. Then, we will argue that the rank of A is large because it is a design matrix. 
This will show that the rank of V is small. 

Consider a special line £ which passes through three points Vi,Vj,Vk- This gives a linear 
dependency among the three vectors Vi,Vj,Vk (we identify a point with its vector of coordinates 
in the standard basis). In other words, this gives a vector a = (a\, . . . , a n ) which is non-zero only 
in the three coordinates i,j, k and such that a ■ V = 0. If a is not unique, choose an arbitrary 
vector a with these properties. 

Our strategy is to pick a family of collinear triples among the points in our configuration 
and to build the matrix A from rows corresponding to these triples in the above manner. 

Let C denote the set of all special lines in the configuration (i.e. all lines containing at least 
three points). Then each Lj is a subset of C containing lines passing through V{. For each t £ C 
let Vi denote the set of points in the configuration which lie on the line I. Then | V$\ > 3 and we 
can assign to it a family of triples Ti C Vf ', given by Lemma 14.61 (we identify Vi with [r], where 
r = \Vi\ in some arbitrary way). 
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We now construct the matrix A by going over all lines £ £ C and for each triple in Tg adding 
as a row of A the vector with three non-zero coefficients a = (a±, . . . ,a n ) described above (so 
that a is the linear dependency between the three points in the triple). 

Since the matrix A satisfies A ■ V = by construction, we only have to argue that A is a 
design matrix and bound its rank. 

Claim 4.7. The matrix A is a (3, 3k, 6) -design matrix, where k = \5n\ — 1. 

Proof. By construction, each row of A has exactly 3 non-zero entries. The number of non-zero 
entries in column i of A corresponds to the number of triples we used that contain the point Vi. 
These can come from all special lines containing V{. Suppose there are s special lines containing 
Vi and let r±, . . . ,r s denote the number of points on each of those lines. Then, since the lines 
through Vi have only the point Vi in common, we have that 



The properties of the families of triples Ti guarantee that there are 3(rj — 1) triples containing 
Vi coming from the j'th line. Therefore there are at least 3k triples in total containing Vi. 

The size of the intersection of columns i\ and ii is equal to the number of triples containing 
the points Vi l ,Vi 2 that were used in the construction of A. These triples can only come from one 
special line (the line containing these two points) and so, by Lemma 14.6} there can be at most 
6 of those. □ 

Applying Theorem 13.21 we get that 



where the third inequality holds as 5n > 13 since otherwise the theorem trivially holds. Since 
A ■ V = we have that 



4.3 Average-Case Version 

In this section we use Theorem 14.31 to argue about the case where we only know that there are 
many collinear triples in a configuration. 



s 



E( r i - l )> k - 





□ 
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Theorem 4.8 (Average-case SG theorem). Let V = {vi, . . . ,v rn } C C d be a set of m distinct 
points. Let T be the set of (unordered) collinear triples in V . Suppose \T\ > am 2 and that every 
two points v,v' in V appear in at most c triples in T, then there exists a subset V C V such 
that \V'\ > am/ (2c) and dim(V') < 0(l/a 2 ). 

Notice that the bound on the number of triples containing a fixed pair of points is necessary 
for the theorem to hold. If we remove this assumption than we could create a counter-example 
by arranging the points so that m 2 / 3 of them are on a line and the rest span the entire space. 

Lemma 4.9. Let H be a 3-regular hypergraph with vertex set [m] and am 2 edges of co-degree 
at most c (i.e. for every i ^ j in [m], the set {i,j} is contained in at most c edges). Then there 
is a subset M C [m] of size \M\ > am/ (2c) so that the minimal degree of the sub-graph of H 
induced by M is at least am/2. 

Proof. We describe an iterative process to find M. We start with M = [m\. While there exists 
a vertex of degree less than am/2, remove this vertex from M and remove all edges containing 
this vertex from H. Continuing in this fashion we conclude with a set M such that every point 
in M has degree at least am/2. This process removed in total at most m ■ am/2 edges and 
thus the new H still contains at least am 2 /2 edges. As the co-degree is at most c, every vertex 
appears in at most cm edges. Thus, the size of M is of size at least am/ (2c). □ 

Proof of Theorem \4-8\ The family of triples T defines a 3-regular hypergraph on V of co-degree 
at most c. Lemma 14.91 thus implies that there is a subset V' C V of size \V'\ > am/ (2c) that is 
an (a/2)-SG configuration. By Theorem 14.31 V' has dimension at most 0(l/a 2 ). □ 

5 Robust SG Theorem for /c-Flats 

In this section we prove two high-dimensional analogs of the SG theorem. Let &(vi, . . . , 
(fl for 'flat') denote the affine span of k points (i.e. the points that can be written as linear 
combinations with coefficients that sum to one) . We call v\ , . . . , Vf. independent if their flat is 
of dimension k — 1 (dimension means affine dimension) , and say that vi, . . . ,Vk are dependent 
otherwise. A k-flat is an affine subspace of dimension k. 

In the following V is a set of n distinct points in complex space C d . A A;-flat is called ordinary 
if its intersection with V is contained in the union of a (k — l)-flat and a single point. A fc-flat 
is elementary if its intersection with V has exactly k + 1 points. Notice that for k = 1 (lines) 
the two notions of ordinary and elementary coincide. 

For dimensions higher than one, there are two different definitions that generalize that of SG 
configuration. The first definition is based on ordinary fc-flats (though in a slightly stronger way 
which will be more useful in the proofs to come). The second definition (which is less restricted 
than the first one) uses elementary k- flats. 

Definition 5.1. The set V is a 5-SG£ configuration if for every independent vi, ■ ■ ■ y Vf, G V 
there are at least 5n points u G V s.t. either u G fl(i>i, • • • , i>fe) or the fc-flat fl(«i, . . . , u&, u) 
contains a point w outside fl(fi, . . . , Vk) U {u}. 
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Definition 5.2. The set V is a configuration if for every independent vi,...,Vk G V 

there are at least 5n points u G V s.t. either u G fl(t>i, . . . , Vfe) or the fc-flat fl(t>i, . . . ,Vk,u) is 
not elementary. 

Both definitions coincide with that of SG configuration when fe = 1: Indeed, fl(t>i) = t>i 
and fl(vi,n) is the line through V\,u. Therefore, u is never in fl(i>i) and the line R(vi,u) is not 
elementary iff it contains at least one point w {vi,u}. 

We prove two high-dimensional versions of the SG theorem, each corresponding to one of the 
definitions above. The first uses the more restricted 'star' definition and gives a strong upper 
bound on dimension. The second uses the less restricted definition and gives a weaker bound 
on dimension. 

Theorem 5.3. Let V be a S-SGf, configuration. Then dim(V) < f(5,k) with 

f(5,k) = 0{(k/5) 2 ). 
Theorem 5.4. Let V be a 5-SGk configuration. Then dim(V) < g(6, k) with 

g(5,k)=2 ck /5 2 

with C > 1 a universal constant. 

The proofs of the two theorems are below. Theorem 15.31 follows by an appropriate induction 
on the dimension, using the (one-dimensional) robust SG theorem. Theorem 15.41 follows by 
reduction to Theorem 15.31 

Before proving the theorems we set some notations. Fix some point vq G V. By a normaliza- 
tion w.r.t. Vq we mean an affine transformation N : C d \— > C d which first moves vq to zero, then 
picks a hyperplane H s.t. no point in V (after the shift) is parallel to H (i.e has inner product 
zero with the orthogonal vector to H) and finally multiplies each point (other than zero) by a 
constant s.t. it is in H. 

Claim 5.5. For such a mapping N we have that Vq, . . . , are dependent iff N{v\), . . . , N(vk) 
are dependent. 

Proof. Since translation and scaling does not affect dependence, w.l.o.g. we assume that vq = 
and that the distance of the hyperplane H from zero is one. Let h be the unit vector orthogonal 
to H. For all i G [k] we have N(vi) = Vi/(vi, h). Assume that Vq, v\, . . . , Vk are dependent, that 
is, w.l.o.g. i?£ = Ylie[k-i\ a i v i f° r some a l> • • • > Q-k—l' For all i G [k—1] define 6j = a{{vi, h). 
Thus N(v k ) = J2ie[k-i] a i v i/( v k, h) = Eie[fe-i] b i N ( v i) where Y,ie[k-i] b i = whi ch means that 
N(vi), . . . , N(vk) are dependent. Since the map a,{ \-± hi is invertible, the other direction of the 
claim holds as well. □ 

We first prove the theorem for 5-SGt configurations. 

Proof of Theorem \5.3[ The proof is by induction on k. For k = 1 we know f(8, 1) < c5~ 2 with 
c > 1 a universal constant. Suppose k > 1. We separate into two cases. The first case is when 
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V is an (<5/(2fc))-SGi configuration and we are done using the bound on k = 1. In the other 
case there is some point vq G V s.t. the size of the set of points on special lines through vq is at 
most 5/ (2k) (a line is special if it contains at least three points). Let S denote the set of points 
on special lines through vq. Thus |S| < 5n/(2k). Let N : C d i— > C d be a normalization w.r.t. vq. 
Notice that for points v S the image N(v) determines v. Similarly, all points on some special 
line map to the same point via N. 

Our goal is to show that V = N(V \ {v }) is a ((1 - lJ(2k))8)-SG* k _ t configuration (after 
eliminating multiplicities from V). This will complete the proof since dim(y) < dim(V') + 1. 
Indeed, if this is the case we have 

f(5, k) < max{4c(k/5) 2 J((l - l/(2fc))5, k - 1) + 1}. 

and by induction we have f(5, k) < 4c(k/5) 2 . 

Fix v[, . . . , v' k _i G V to be k— 1 independent points (if no such tuple exists then V is trivially 
a configuration). Let V\, . . . , Vk~\ G V be points s.t. N(yi) = v[ for i G [k — 1]. Claim 1531 implies 
that vq,vi,... , Vk-i are independent. Thus, there is a set U C V of size at least 5n s.t. for every 
u £ U either u G fl(uo, fi, . . . , ffc-i) or the Axflat fl(wo, • • • , Ufe-i, contains a point u> outside 
fl(uo,«l, • • • ,Wfe-l) U {u}. 

Let = U \ S so that is invertible on f7 and 

> - \S\ > (1 - l/(2Jb))5n. 

Suppose u £ U and let = N(u). By Claim [531 if u G fl(uo, fi, . . . , v ^-i) then v! is in 
fl(ui, • • • , v'k-i)- Otherwise, fl(uo, v i, . . . , Vk-i,u) contains a point u; outside fl(fo, Ui, ■ ■ ■ , Wfc-i) U 
{n}. Let u/ = N(w). We will show that u;' is (a) contained in the (A; — l)-flat Q.(v[, . . . , u') 
and (b) is outside fl(w' 1 , . . . , t'^_ 1 )U{u / }. Property (a) follows from Claim [5"31 since vq, v\, . . . , v^-i, 
u, w are dependent and so i^, . . . , v' k _ v u\ w' are also dependent. To show (b) observe first that 
by Claim 15.51 the points v[, . . . , v' k _ 1 ,u' are independent (since Vq, v%, . . . , v k -i-,u are indepen- 
dent) and so v! is not in fl(f 1 , . . . ,f[,_ 1 ). We also need to show that w' ^ v! but this follows 
from the fact that u ^ w and so w' = N(w) ^ N(u) = v! since N is invertible on U and u G U. 
Since 

\N(U)\ = \U\ > (1 - l/(2k))5n > (1 - l/(2k))6\V'\ 
the proof is complete. □ 

We can now prove the theorem for <5-SGfc configurations. 

Proof of Theorem \5.4\ The proof follows by induction on k (the case k = 1 is given by Theo- 
rem H3]). Suppose k > 1. Suppose that dim(V) > g(S, k). We want to show that there exist k 
independent points vi,...,Vf ! s.t. for at least 1 — 5 fraction of the points w G V we have that 
w is not in fl(vi, . . . ,v k ) and the flat fl(t»i, . . . ,V)-,w) is elementary (i.e. does not contain any 
other point). 

Let k' = g(l, k — 1). By choice of g we have g(5, k) > f(S, k' + 1) with / from Theorem 15.31 
Thus, by Theorem 15.3} we can find k' + 1 independent points vi, . . . , Vk'+i s.t. there is a set 
U C V of size at least (1 — S)n s.t. for every u G U we have that u is not in fl(ui, . . . , vy+i) and 
the (k' + l)-flat fl(ui, . . . , Vk'+i,u) contains only one point, namely u, outside fl(«i, . . . , ffc'+i). 
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We now apply the inductive hypothesis on the set V fl fl(«i, ■ ■ ■ , Wfe'+i) which has dimension 
at least fc' = a(l, k — 1). This gives us k independent points v[, . . . ,v' k that define an elementary 
{k — l)-flat B.(v[, . . . ,vt). (Saying that V is not l-SGfe_x is the same as saying that it contains 
an elementary (k — l)-flat). Joining any of the points u £ U to v[, . . . , v' k gives us an elementary 
fc-flat and so the theorem is proved. □ 



6 Generalizations of the Motzkin-Rabin Theorem 

In this section we prove two variants of the Motzkin-Rabin Theorem. The first is a quantitative 
analog in the spirit of Theorem 14.31 The second is a variant in which the number of colors is 
three (instead of two). 

6.1 A Quantitative Variant 

Definition 6.1 (<5-MR configuration). Let Vi, V 2 be two disjoint finite subsets of C d . Points in 
V\ are of color 1 and points in V 2 are of color 2. A line is called bi- chromatic if it contains at 
least one point from each of the two colors. We say that V\, V 2 are a 5-MR configuration if for 
every i G [2] and for every point p G Vi, the bi-chromatic lines through p contain at least S\Vi\ 
points. 

Theorem 6.2. Let Vi,V 2 C C d be a 5-MR configuration. Then 

dim(V 1 ,V 2 ) < 0(l/<5 4 ). 

Proof. We will call a line passing through exactly two points in V\ (resp. V 2 ) a Vi-ordinary 
(resp. ^-ordinary) line. W.l.o.g. assume 

N<|v 2 |. 

We seperate the proof into two cases: 

Case I is when V 2 is a (<5/2)-SG configuration. Then, by Theorem 14.31 dim(V2) < 0(l/5 2 ). 
If in addition 

dim(Vi) < 13/05/2) 2 

then we are done. Otherwise, by Theorem 14.3^ there exists a point oq £ V\ such that there are 
at least (1 — 5/2)|Vi| Vx-ordinary lines through a . Let ai, . . . , denote the points in V\ that 
belong to these lines with k > (1 — <5/2)|Vi|. We now claim that V% U {ao} spans all the points 
in V\. This will suffice since, in this case, dim(T^2) < 0(l/8 2 ). Let a € V\. Then, since Vi,V 2 is 
a <5-MR configuration, there are at least 5\V\\ points in V\ such that the line through them and 
a contains a point in V 2 . One of these points must be among ai, . . . , a&, say it is ai. Since a is 
in the span of V 2 and a\ and since a\ is in the span of V 2 and ao we are done. 

Case II is when V 2 is not a (5/2)-SG configuration. In this case, there is a point b £ V 2 
such that there are at least (1 — <5/2) | V2 1 ^-ordinary lines through b. From this fact and from 
the 5-MR property, we get that \V\\ > (5/2)\V 2 \ (there are at least (<5/2) | V2 1 ^-ordinary lines 
through b that have an additional point from V\ on them). This implies that the union V\ U V 2 
is a (<5 2 /4)-SG configuration and the result follows by applying Theorem 14.31 □ 
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6.2 A Three Colors Variant 

Definition 6.3 (3MR configuration). Let Vi, V^VJj be three pairwise disjoint finite subsets of 
C d , each of distinct points. We say that Vi, V2, V3 is a 3MR-configuration if every line £ so that 
£ D (Vj. U V2 U V3) has more than one point intersects at least two of the sets V\, V2, V3. 

Theorem 6.4. Let V\, V2, V3 be a 3MR configuration and denote V = V\ U V% U V3. Then 

dim(V) < Oil). 

Proof. Assume w.l.o.g. that V\ is not smaller than V2, V3. Let a = 1/16. There are several cases 
to consider: 

1. V\ is an a-SG configuration. By Theorem 14.31 the dimension of V\ is at most 

di = 0(l/a 2 ). 

Consider the two sets 

Vi = V 2 \span(Fi) and V 3 ' = V 3 \ span(Vi), 
each is a set of distinct points in C d . Assume w.l.o.g. that \V 2 \ > \V S '\. 

1.1. V2 is an a-SG configuration. By Theorem 14.31 the dimension of V 2 ' is at most 

<k = 0(l/a 2 ). 

Fix a point v$ in V3. For every point u 7^ f 3 in V3' the line through v%,v contains a 
point from span(Vi) U V 2 . Therefore, 

dim(V r ) < di + d 2 + 1 < 0(1). 

1.2. K;' is not an a-SG configuration. There is a point V2 in V^' so that for k > | | /2 
of the points v 7^ t>2 in the line through U2,u does not contain any other point 
from V 2 '- If V 2 ' = span(Vi, ^2) then the dimension of V\ U V2 is at most d\ + l and we 
are done as in the previous case. Otherwise, there is a point v' 2 in V 2 \ span^i,^)- 
We claim that in this case \V^\ > k/2. Denote by P2 the k points v 7^ v 2 in V 2 so that 
the line through V2,v does not contain any other point from V 2 . For every v € P2 
there is a point Vi^(v) in Vi U V3 that is on the line through v,V2 (the point V2 is 
fixed). There are two cases to consider. 

The first case is that for at least k/2 of the points v in P2 we have Vi^(v) 6 V3. In 
this case clearly IV3I > k/2. 

The second case is that for at least k/2 of the points v in P2 we have Vi^(v) G V\. 
Fix such a point v G P2 (which is in span(Vi, t^))- The line through v' 2 ,v contains a 
point v' from Vi U V3. The point v' is not in span(Vi), as if it was then v 2 would be 
in span(f,u') C span(Vi,w). Therefore v' is in V3. This also implies that IV3I > k/2. 
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Denote V = Vi U V3. So we can conclude that for every v' in V the special lines 
through v' contain at least |U'|/8 of the points in V\ U V2 U V3. As in the proof of 
Theorem 14.31 we can thus define a family of triples T, each triple of three distinct 
collinear points in V, so that each v' in V' belongs to at least |U'|/8 triples in T and 
each two distinct v', v" in V' belong to at most 6 triples. 

By a slight abuse of notation, we also denote by V the matrix with rows defined 
by the points in V. Let V\ be the submatrix of V with row defined by points in 
span(Vi) n V and V 1 be the submatrix of V with row defined by points in V'. Use the 
triples in T to construct a matrix A so that A ■ V = 0. Let A\ be the submatrix of A 
consisting of the columns that correspond to span(Vi) n V and A' be the submatrix 
of A consisting of the columns that correspond to V . Therefore, A' -V' = —A\ ■ V\ 
which implies 

rank(A' • V') < rank(Ai • V x ) < d x . 

By the above discussion A' is a (3, |U'|/8, 6)-design matrix and thus, by Theorem l3.2l 
has rank at least 

\V'\ -0(1) 

and so 

dim(V0 < 0(1) + di < 0(1). 
We can finally conclude that 

dim(F) < di + dim(y') < O(l). 

2. V\ is not an a-SG configuration. There is a point v\ in V\ so that for at least |Vi|/2 of 
the points v 7^ v\ in V\ the line through v%,v does not contain any other point from V\. 
Assume w.l.o.g. that | V2 1 > 1 1/3 1 - This implies that 

\V 2 \ > |^i |/4. 

2.1. I V3 1 < IV2I/I6. In this case the configuration defined by V\ U V2 is an a-SG configu- 
ration. By Theorem 14.31 the dimension of V\ U V2 is at most 

di, 2 = 0(l/a 2 ). 

Fix a point U3 in V3. For every point t> 7^ v% in V3 the line through ^3,^ contains a 
point from V\ U V2- Therefore, 

dim(V) < di, 2 + l < O(l). 

2.1. IV3I > IV2I/I6. In this case V is an a-SG configuration. By Theorem 14.31 the dimen- 
sion of V is thus at most 0(l/a 2 ). 

□ 
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7 Two- Query Locally Correctable Codes 



We now prove the non-existence of 2-query (linear) locally correctable codes (LCC) over C. We 
start by formally denning locally correctable codes: 

Definition 7.1 (Linear locally correctable code (LCC)). Let F be some field. A (q,<5)-LCC 
over F is a linear subspace C C F m such that there exists a randomized decoding procedure 
D : F m x [to] i — ^ F with the following properties: 

1. For all x G C, for all i G [to] and for all v G F m with w(v) < 5m we have that D (x + v, i) = 
Xi with probability at least 3/4 (the probability is taken only over the internal randomness 
of.D). 

2. For every y G F m and i G [to], the decoder D(y,i) reads at most q positions in y. 
The dimension of an LCC is simply its dimension as a subspace of F m . 

In the above definition we allow the algorithm D to perform operations over the field F. 
Since we do not care about the running time of D we do not discuss issues of representation of 
field elements and efficiency of handling them. (In any case, it turns out that for linear codes 
in the small number of queries and low error case, one can assume w.l.o.g. that the decoder is 
also linear, see Lemma 17.41 below.) 

Our result on locally decodable codes is the following: 

Theorem 7.2 (Restatement of Theorem [3] — non-existence of 2 query LCCs over C). Let C C 
C m be a (2,5)-LCC overC. Then 

dim(C) < 0(l/5 9 ). 

As in Theorem 14.31 a ls° i n this theorem, 5 can be an arbitrary function of to. To make 
the connection between LCCs and SG-configurations explicit, we define the notion of a <5-LCC 
configuration. 

Definition 7.3 (5-LCC Configuration). A list of non-zero points (y±, . . . ,v m ) in C d (not neces- 
sarily distinct) is called a 5-LCC configuration if for every subset A C [to] of size at most 8m 
and for every i G [m], there exist j,k G [to] \ A such that either Vi G {vj,Vk} (in which case Vi 
can be recovered by its own copies) , or Vi , Vj , are three distinct collinear points (in which case 
Vi is recovered by two other coordinates). 

The following lemma shows the connection between these two notions. 

Lemma 7.4. // there exists a (2,5)-LCC of dimension n over C then there exists a 5-LCC 
configuration of dimension at least n — 1 over C. 

To prove the lemma we will use the following definition. 

Definition 7.5 (Generating set). Let C C F m be a subspace. We say that a list of vectors 
V = (v±, . . . ,v m ) in F n is a generating set for C if 

C = {((y,v 1 ),(y,v 2 ),...,(y,v m )) \ y G F n } , 
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where (y, v) is the standard inner product over F. 

Proof of Lemma \ 7.4\ Let V = (v\, . . . ,v m ) be a generating set for C with dim(V) > n — 1. 
We might lose 1 since we defined dim(y) as the dimension of the smallest affine subspace 
containing V. When the local decoder for C reads two positions in a codeword, it is actually 
reading (y,Vj), {y,Vk) for some vector y £ C n (or noisy versions of them). In order to be able 
to recover (y,Vi) from (y,Vj), (y,Vk) with positive probability it must be that Vi £ spanjfj, v^}. 
(If we choose y as Gaussian and v i is not in the span of v j , Vk then even conditioned on the 
values of (y,vj), (y,Vk) the r.v. (y,Vi) takes any specific value with probability zero.) Applying 
an invertible linear transformation on V preserves properties such as one vector being in the 
span of another set. So we can assume w.l.o.g. that the first coordinate in all elements of V is 
non-zero. Scaling each V{ by a non-zero scalar also preserves the properties of spans and so we 
can assume w.l.o.g. that the first coordinate in each Vi is equal to 1. Now, for Vi to be in the 
span of Vj, Vk it must be that either Vi S {vj, v^} or v i is on the line passing through v j, (and 
they are all distinct). Thus, we have a <5-LCC configuration with dimension n — 1. □ 

In view of this lemma, in order to prove Theorem 17.21 it is enough to prove: 
Theorem 7.6. Let V = (vi, . . . ,v m ) G (C d ) m be a 5-LCC configuration. Then 

dim(V) < 0{l/5 9 ). 

7. 1 Proof of Theorem PfTHl 

Let V = (y±, . . . , v m ) be the list of m points in C d . The main difficulty in proving the theorem 
is that some of these points may be the same. That is, two points Vi, Vj can actually correspond 
to the same vector in C^. In this case we say that Vi,Vj are copies of each other. Otherwise, 
we say that Vi, Vj are distinct. If v is a point in the list V, we let the multiplicity of v, denoted 
M(v), be the number of times that (a copy of) v occurs in V . 

We note that while repetitions make the proof of Theorem 17.61 more complicated, we do not 
know if they actually help in constructing LCCs with better parameters. Our proof will proceed 
in an iterative way, at each step identifying a sufficiently large sublist with small dimension and 
removing it. The key step will be the following theorem: 

Theorem 7.7. There exists an integer K\ > s.t. the following holds. Let V = (v\, . . . ,v m ) £ 
(C d ) m be a 5-LCC configuration. Then there exists a sublist V C V of size at least 5 3 m/Ki and 
dimension at most Ki/S 6 . 

Proof. If there exists a point v E V with multiplicity larger that 5m/ 10 then the theorem is 
true by taking V' to be all copies of this point. This avoids the case where a point is recovered 
mostly by its own copies. For the rest of the proof we can, thus, assume the following. 

Fact 7.8. For all v € V and for every sublist A ofV of size at most 5m/2 there is a collinear 
triple containing v such that the other two points in the triple are not in A (and are distinct 
from v). 
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We will describe a (probabilistic) construction of a family of collinear triples and build a 
design matrix from it. We call a triple of points in V good if it contains three distinct collinear 
points. We define a family T of good triples as follows: For every line i that has at least three 
distinct points in V we will define (randomly) a family Tg of good triples (later we will fix the 
randomness). The family T will be the union of all these sets. 

Remark 7.9. The construction of T we present is probabilistic. It is possible to construct T 
explicitly and achieve similar properties. We choose to present the probabilistic construction as 
it is simpler and less technical. 

Let £ be such a line with r points on it (counting multiplicities). Denote by V(£) the sublist 
of V containing all points that lie on I. We first take the family F of triples on [r] given by 
Lemma 14.61 and then pick a random one-to-one mapping p : [r] i— > V(£). For a triple t in F we 
denote by p(t) the triple of points in V(i) that is the image of t under p. We take Tg to be the 
set of all triples p(t) with t 6 F and such that p(t) is good (i.e., it 'hits' three distinct points). 

Intuitively, we will have many good triples on a line (in expectation) if there are no two 
points whose copies cover most of the line (then the probability of hitting three distinct points 
is small). We will later show that this cannot happen on too many lines. 

The next proposition shows that there is a way to fix the randomness so that T contains a 
quadratic number of triples. 

Proposition 7.10. The expectation of \T\ is at least am 2 with a = (J/15) 3 . 

We will prove this proposition later in Section 17.21 and will continue now with the proof of 
the theorem. 

Fix T to be a family of triples that has size at least the expectation of \T\. By construction 
and Lemma |4.6| the family T contains only good triples and each pair of points appears in at 
most 6 different triples (since every two distinct points define a single line and two non-distinct 
points never appear in a triple together). The family T thus defines a 3-regular hypergraph with 
vertex set [m] and at least am 2 edges and of co-degree at most 6. Lemma 14.91 thus implies that 
there is a sublist V of V of size at least 



with the following property: Let T' be the subfamily of T that V induces. Every v 1 in V is 
contained in at least am/2 triples in T'. 

By a slight abuse of notation, we also denote by V the m' x d matrix with rows defined by 
the points in V' (including repetitions). We now use the triples in T' to construct a matrix A' 
so that A' ■ V = 0. By the above discussion A' is a (3, am/2, 6)-design matrix and thus, by 
Theorem 13.21 has rank at least 



V'\ = m! > am/12 > (5/45) 3 m 




and so 



dim(y') < (18/a) 2 < (60/<5) 6 



as was required. 



□ 
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The next proposition shows how the above theorem can be used repeatedly on a given LCC. 

Proposition 7.11. There exist an integer K 2 > s.t. the following holds: Let V = (v\, . . . , v m ) £ 
(C d ) m be a 5-LCC configuration and let U,W be a partition ofV into two disjoint sublists such 
that W n span(U) = 0. Then there exists a new partition of V to two sublists U' and W such 
that W n span(U') = and such that 

1. \U'\ > \U\ + 5 3 m/K 2 , and 

2. dim(U') < dim(U) + K 2 /5 6 . 

Proof. First, we can assume that all points in W have multiplicity at most 5m/2 (otherwise we 
can add one point from W with high multiplicity to U to get U'). Thus, for all points v and 
all sublists A of size at most 5m/ 2 there is a collinear triple of three distinct points containing 
v and two other points outside A. Again, this is to avoid points that are recovered mostly by 
copies of themselves. 

For a point w £ W we define three disjoint sublists of points U(w), P\(w) and P2(w). The 
first list, U(w), will be the list of all points in U that are on special lines through w (that is, 
lines containing w and at least two other distinct points). Notice that, since w span(C/), each 
line through w can contain at most one point from U. The second list, Pi(w), will be the list of 
points in W \{w} that are on a line containing w and a point from U. The third list, P2(w), will 
be of all other points on special lines through w (that is, on special lines that do not intersect 
U). These three lists are indeed disjoints, since w is the only common point between two lines 
passing through it. By the above discussion we have that |Pi(w)| + | -F2 ('"-') I ^ Sm/2 for all 
w G W (since removing these two lists destroys all collinear triples with w). We now separate 
the proof into two cases: 



Case I : There exists w G W with |Pi(io)| > 5m/ 4. In this case we can simply take U' to 
be the points in V that are also in the span of {w} U U. This new U' will include all points in 
Pi(w) and so will grow by at least 5m/ 4 points. Its dimension will grow by at most one and so 
we are done. 



Case II : For all w G W, \P2(w)\ > 5m /4. Denote m' = \W\. In this case W itself is a 
<5'-LCC configuration with 

s>= 5rn 
8m' 

Applying Theorem 17.71 we get a sublist U" C W of size at least 

^ > wf ■ £ 

Ki Ki 

and dimension at most 

w- Kl{s,5f - 

We can thus take V to be the points in V that are in the span of U U U" and the proposition 
is proved. □ 
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Proof of Theorem 7.6 We apply Proposition 17. 1 11 on V, starting with the partition U = 0, W = 
V and ending when U = V, W = 0. We can apply the proposition at most K2/& 3 times and in 
each step add at most -ft^/^ 6 to the dimension of A (which is initially zero). Therefore, the final 
list U = V will have dimension at most 0(l/5 9 ). □ 



7.2 Proof of Proposition 17.101 

Order the points in V so that all copies of the same point are consecutive and so that M(vi) < 
M{vj) whenever i < j. Let S C V be the sublist containing the first <5m/10 points in this 
ordering (we may be splitting the copies of a single point in the middle but this is fine). We will 
use the following simple fact later on: 

Fact 7.12. IfveS and M(v') < M(v) then v' G S. 

For a point v £ V we denote by T(v) the set of (ordered) triples in T containing v and for a 
line £ by Tg{v) the set of (ordered) triples in Tg containing v. Recall that these are all random 
variables determined by the choice of the mappings p for each line £. 

The proposition will follow by the following lemma. 
Lemma 7.13. Let v G S. Then the expectation of\T(v)\ is at least (5/10) 2 m. 

The lemma completes the proof of the proposition: summing over all points in S we get 



ENTll > K 



(l/3)J2\T(v)\ 



(each triple is counted at most three times) 



v&V 

> (l/3)2>[|T(r;)|] 

> (1/3) • (5m/W) • ((5/W) 2 m) > (5/15) 3 m 2 . 

Proof of Lemma \7.1S[ Denote by L(v) the set of all special lines through v. To prove the lemma 
we will identify a subfamily L'(v) of L(v) that contributes many triples to T(v). To do so, we 
need the following definitions. For a set 7 C C d denote by ^(7) the set of distinct points in V 
that are in 7. Denote M{^) = J2veP(-y) M( v )- Denote by P(S) the set of distinct points not in 
S. 

Definition 7.14 (Degenerate line). Let £ G L(v). We say that £ G L[v) is degenerate if either 

1. The size of P(£) D P(S) is at most one. That is, £ contains at most one distinct point 
outside S. Or, 

2. There exists a point V£ G P(£), distinct from v, such that M(vg) > (1 — 5/10)M(£). 

A degenerate line satisfying the first (second) property above will be called a degenerate line of 
the first (second) kind. 
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Define U(v ) as the set of line £ in L(v) that are not degenerate. We will continue by proving 
two claims. The first claim shows that every line in L'(v) contributes many triples in expectation 
to T(v). 

Claim 7.15. For every £ G L'(v) we have E[\T e (v)\] > 5M(£)/10. 

Proof. Denote r = |M(^)|. The family of triples Ti is obtained by taking a family of r(r — 1) 
triples F on [r] (obtained from Lemma I4.6P and mapping it randomly to £, omitting all triples 
that are not good (those that do not have three distinct points). For each triple t S F the 
probability that pit) will be in Ti(v) can be lower bounded by 



The factor of 3/r comes from the probability that one of the three entries in t maps to v 
(these are disjoint events so we can sum their probabilities). 

The next factor, 2r/(3(r — 1)), comes from the probability that the second entry in t (in 
some fixed order) maps to a point distinct from v. Indeed since \P{£) n P{S)\ > 2 and using 
Fact 17.121 we know that there are at least two distinct points v',v" on £ with M(v') > M{v) and 
M(v") > v. Since M(v) + M(v') + M(v") < r, we get that M(v) < r/3, and so there are at 
least 2r/3 "good" places for the second point to map to. 

The last factor, 5/20, comes from the probability that the third element of the triple will 
map to a point distinct from the first two. The bound of 5/20 will follow from the fact that £ 
does not satisfy the second property in the definition of a degenerate line. To see why, let v 2 be 
the image of the second entry in t. Since £ is not degenerate, r' = r — M[v 2 ) > 5r/\0. Since 
\P(£) n P(S)\ > 2, there is a point v' in P(S) not in {v, v 2 }, and hence, by Fact I7TT21 M(v) < 
M{v'). Since M(v) + M(v') < r' , we get that M(v) < r'/2. Thus r' - M(v) > r'/2 > 5r/20. 
But r 1 — M(v) is exactly the number of 'good' places that the third entry can map to that are 
from v and 

Using linearity of expectation we can conclude 



3 2r 5 5 



r ' 3(r - 1) ' 20 10(r - 1) 



n\Te{v)\] >r(r-l) • 
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5r/10. 



10(r - 1) 



□ 



The second claim shows that there are many points on lines in L'(v). 
Claim 7.16. With the above notations, we have: 




Proof. Assume in contradiction that 
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Let A' denote the sub-list of V containing all points that lie on lines in L'{v) so that |A'| < 
Sm/10. We will derive a contradiction by finding a small sublist A of V (containing A' and two 
other small sub-lists) that would violate Fact 17.81 That is, if we remove A from V, we destroy 
all collinear triples containing v. 

Let I be a degenerate line of the second kind. Then there is a point vg on it that is distinct 
from v and has multiplicity at least (1 — <5/10)ilf {£). For every such line let Ag denote the sublist 
of V containing all of the at most (S/10)M(£) — M(v) points on this line that are distinct from 
both v and vg. Let A2 denote the union of these lists Ag over all degenerate lines of the second 
kind. We now have that |Aa| < Sm/10 since ^2i(M(£) — M(v)) < m and in each line £ we have 

|A*| < (5/W)M{£) - M(v) < (5/W)(M(£) - M(v)). 

Notice that, removing the points in A2 destroys all collinear triples on degenerate lines of the 
second kind. 

Finally, let A5 denote the sublist of V containing all points that have a copy in S. Thus 
As contains the list S (of at most Sm/10 elements), plus all of the at most Sm/10 copies of 
the last point in S, meaning that |A<j| < Sm/5. Removing A5 destroys all collinear triples on 
degenerate lines of the first kind. Define A as the union of the three sublists A', A2 and A5. 
From the above we have that removing A from V destroys all collinear triples containing V and 
that |A| < 4(<yi0)m < Sm/2. This contradicts FactEEJ □ 

Combining the two claims we get that for all v € S, 

E[|T(v)|]> Yl E i\ T e( v )\}> Yl ^M^)/W>{5/10)-{5m/10) = {5/10) 2 m. 

£eL'(v) i€L'(v) 

This completes the proof of Lemma 17.131 □ 



8 Extensions to Other Fields 

In this section we show that our results can be extended from the complex field to fields of 
characteristic zero, and even to fields with very large positive characteristic. The argument is 
quite generic and relies on Hilbert's Nullstellensatz. 

Definition 8.1 (T-matrix). Let m, n be integers and let T C [m] x [n]. We call an m x n matrix 
A a T-matrix if all entries of A with indices in T are non-zero and all entries with indices outside 
T are zero. 

Theorem 8.2 (Effective Hilbert's Nullstellensatz [K0I88]). Let gi, . . . ,g s £ Z[yi, ■■■ ,yt] be de- 
gree d polynomials with coefficients in {0,1} and let 

Z±{yeC t \g i (y) = OViE [s]}. 

Suppose h G Zjzi, . . . , Zt] is another polynomial with coefficients in {0, 1} which vanishes on Z. 
Then there exist positive integers p, q and polynomials fi, . . . , f s E Z[yi, . . . , such that 

s 

^2fi -9i = p-h q . 

i=i 
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Furthermore, one can bound p and the maximal absolute value of the coefficients of the fi 's by 
an explicit function Ho(d,t,s). 

Theorem 8.3. Let m,n,r be integers and let T C [m] x [n]. Suppose that all complex T- 
matrices have rank at least r. Let ¥ be a field of either characteristic zero or of finite large 
enough characteristic p > Po(n,m), where Pq, is some explicit function of n and m. Then, the 
rank of all T -matrices over F is at least r. 

Proof. Let g\ , . . . , g s G C[{xij \ i £ [m],j E [n]}] be the determinants of all r x r sub-matrices of 
an m x n matrix of variables X = (xij). The statement "all T-matrices have rank at least r" can 
be phrased as "if x^ = for all (i,j) T and gk(X) = for all k £ [s] then Wu j\ & T x ii = 
That is, if all entries outside T are zero and X has rank smaller than r then it must have at 
least one zero entry also inside T. From Nullstellensatz we know that there are integers a, A > 
and polynomials f\, . . . , f s and hij, (i,j) T, with integer coefficients such that 

a ■ ( II x v I = E x *3 ■ M*) + E /*W • 9i{X). (2) 
\(vi)eT / {i,mr fc=i 

This identity implies the high rank of T-matrices also over any field F in which Since we 

have a bound on a in terms of n and m the result follows. □ 



9 Discussion and Open Problems 

Our rank bound for design matrices has a dependence on q, the number of non-zeros in each row. 
Can this dependency be removed? This might be possible since a bound on q follows indirectly 
from specifying the bound on t, the sizes of the intersections. Removing this dependency might 
also enable us to argue about square matrices. Our results so far are interesting only in the 
range of parameters where the number of rows is much larger than the number of columns. 

With respect to Sylvester- Gallai configurations, the most obvious open problem (discussed 
in the introduction) is to close the gap between our bound of 0(l/8 2 ) on the dimension of 5-SG 
configuration and the trivial lower bound of £1(1/5) obtained by a simple partition of the points 
into 1/5 lines. 

Another interesting direction is to explore further the connection between design-matrices 
and LCCs. The most natural way to construct an LCC is by starting with a low-rank design 
matrix and then defining the code by taking the matrix to be its parity-check matrix. Call such 
codes design-LCCs. Our result on the rank of design matrices shows, essentially, that design- 
LCCs over the complex numbers cannot have good parameters in general (even for large query 
complexity). It is natural to ask whether there could exist LCCs that do not originate from 
designs. Or, more specifically, whether any LCC defines another LCC (with similar parameters) 
which is a design-LCC. This question was already raised in [BIW07]. Answering this question 
over the complex numbers will, using our results, give bounds for general LCCs. It is not out 
of the question to hope for bounds on LCCs with query complexity as large as polynomial in m 
(the encoding length). This would be enough to derive new results on rigidity via the connection 
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made in |DvilO| . In particular, our results on design matrices still give meaningful bounds (on 
design-LCCs) in this range of parameters. 

More formally, our results suggest a bound of roughly poly(g, 1/5) on the dimension of (q, 5)- 
LCCs that arise from designs. A strong from of a conjecture from |DvilO| says that an LCC 
CcF" with q = n e queries and error 5 = n~ e , for some constant e > 0, cannot have dimension 
0.99 • n. This conjecture, if true, would lead to new results on rigidity. Thus, showing that any 
LCC defines a design (up to some polynomial loss of parameters), combined with our results, 
would lead to new results on rigidity. 
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